A Rhetorical Analysis Approach to Natural Language Processing

نویسنده

  • Benjamin Englard
چکیده

The goal of this research was to find a way to extend the capabilities of computers through the processing of language in a more human way, and present applications which demonstrate the power of this method. This research presents a novel approach, Rhetorical Analysis, to solving problems in Natural Language Processing (NLP). The main benefit of Rhetorical Analysis, as opposed to previous approaches, is that it does not require the accumulation of large sets of training data, but can be used to solve a multitude of problems within the field of NLP. The NLP problems investigated with Rhetorical Analysis were the Author Identification problem – predicting the author of a piece of text based on its rhetorical strategies, Election Prediction – predicting the winner of a presidential candidate’s re-election campaign based on rhetorical strategies within that president’s inaugural address, Natural Language Generation – having a computer produce text containing rhetorical strategies, and Document Summarization. The results of this research indicate that an Author Identification system based on Rhetorical Analysis could predict the correct author 100% of the time, that a re-election predictor based on Rhetorical Analysis could predict the correct winner of a re-election campaign 55% of the time, that a Natural Language Generation system based on Rhetorical Analysis could output text with up to 87.3% similarity to Shakespeare in style, and that a Document Summarization system based on Rhetorical Analysis could extract highly relevant sentences. Overall, this study demonstrated that Rhetorical Analysis could be a useful approach to solving problems in NLP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology of Rhetorical Figures for Serbian

The paper presents RetFig, a formal domain ontology of rhetorical figures for Serbian. This ontology is one of the necessary steps in developing tools for Natural Language Processing in the Serbian language, especially for tools pertinent to discourse analysis, sentiment analysis and opinion mining. The RetFig ontology was developed taking into account a plethora of rhetorical figures in the mo...

متن کامل

The Rhetorical Parsing of Unrestricted Texts: A Surface-Based Approach

Coherent texts are not just simple sequences of clauses and sentences, but rather complex artifacts that have highly elaborate rhetorical structure. This paper explores the extent to which well-formed rhetorical structures can be automatically derived by means of surface-form-based algorithms. These algorithms identify discourse usages of cue phrases and break sentences into clauses, hypothesiz...

متن کامل

Characterizing In-Text Citations Using N-Gram Distributions

Introduction This article focuses on a Natural Language Processing (NLP) approach for the analysis of citation functions in scientific papers. Bibliometric studies traditionally rely on citation metadata and count the number of times a publication has been cited. However, some recent studies rely also on full text processing on papers, e.g. (Boyack et al., 2013), (Bertin et al., 2013, 2014). Th...

متن کامل

Discoursal Analysis of Rhetorical Structure of an Online Iraqi English Newspaper

Abstract Rhetorical structure is helpful in improving how the writers maintain cohesion in their writings. This study examines how the Iraqi writers maintain cohesion in the text by analyzing the various rhetorical moves in Azzaman, an online Iraqi newspaper. To this purpose, twelve opinion articles from Azzaman Iraqi newspaper, published from January 2013 to June 2013 were analyzed. The findin...

متن کامل

Metadiscourse strategies in Persian research articles; Implications for teaching writing English articles

In order to develop an understanding of the rhetorical conventions in the Persian language and to find out the metadiscursive cultural norms of Iranian writers in their native language writings, it is necessary to probe into the implicit rhetorical features of academic writing which has so far eluded a comprehensive systematic characterization. Metadiscourse marking, which is supposed to be one...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1301.3547  شماره 

صفحات  -

تاریخ انتشار 2013